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An apparatus including a sampling circuit to generate sam- 
pling clocks from a local clock and the samphng clocks to 
sample incoming data and a quarter clock, a phase detector 
to detect a phase difference between a data transition in 
sampled data and the local clock, and a delay line adapted 
to delay the sampled data by the detected phase difference. 
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DELAY LOCKED LOOP BASED DATA on the characteristics of the particular substrate in which the 

RECOVERY CIRCUIT FOR DATA PLL is to be implemented. 

COMMUNICATION Being an analog device and having additional circuitry 

such as a re-timing circuit, data recovery circuits based on 
5 PLL are generally large using large areas of the silicon 

FIELD OF THE INVENTION substrate. Accordingly, it is desired to design a data com- 

The invention relates to delay locked loop-based circuits, munication circuit that does not require an analog circuitry 

and in particular to delay locked loop-based circuits in data ^^^^ ^^L and overcomes the shortcomings described 

communication. above. 

10 

BACKGROUND SUMMARY 

Circuitsusingphaselockedloop(PLL)arewidelyusedin accordance with the invention, there is disclosed an 

data communications. An example of such a circuit may be '''''^f'^f ^ sampbng circuit to generate samphng 

a data recovery circuit, A PLL based circuit may be generaUy , , blocks from a local clock and the samphng clocks to sample 

sufficient where power dissipation is not an issue even incoming data and a quarter clock, a phase detector to ^ 

though communication speeds are high. In certain circuits, ^ Pj^^^ difference between a data transition m sampled data 

the communication speeds may range from 1 MHz to 100 ^^Vf t f ' . Y Ime adapted to delay the 

GHz. In general, however, circuits operating at high speeds ^^°^P^^^ ^^^^ ^^^^^^^^ P^^^ difference, 

are sensitive to power dissipation that results in the over- 20 BRIEF DESCRIPTION OF THE DRAWINGS 
heating of the circuits. In circuits where power conservation 

is an issue, power dissipation is also problematic. accompanymg drawmgs are included to provide a 

^ I J .u J r J • J - further understanding of the invention, and are incorporated 

One commonly used method of reducmg power dissipa- . , , . r.L- -c ^ j ■ 

1 . • « 1 J » u * * in and constitute a part of this specification. The drawmgs 

tion IS to place the circuit m sleep mode when not in use. -n . . l j- . r . , , . . 

In the sleep mode, the circuit is temporarily shut down until 25 f^^""^^ embodiments of the mvention and, togeUier with 

*• . i L- L * 11 f J* « 1 » the description, serve to explam the pnnciples of the mven- 

reactivated which is generally referred to as wake up . For , .1 , • 

1 1. ° j.Lj. tion. In the drawings, 

example, when no transmission is bemg received, the data ^ 

recovery circuit places itself in a sleep mode. When a ^ ^ ^ schematic diagram of a Delay Locked Loop 

transmission is detected, the data recovery circuit wakes up C^^^) ^^^"^ ^^^^ recovery circuit in accordance with an 

to process the transmission. However, the PLL in the data 30 embodunent of the invention; 

recovery circuit generally requires a substantial acquisition FIG. 2 is a schematic diagram of an over-sampling block 

time, perhaps in the order of 94 to 2000 cycles before the and a phase detector in accordance with an embodiment of 

PLL can lock onto the incoming transmission. Therefore, to invention; 

allow for the PLL lock, synchronizing signals of sufficient FIG, 3 illustrates an over-sampling clock timing diagram 

duration are transmitted before the actual transmission is 35 in which the local reference clock is phase shifted 6 times by 

transmitted. However, such synchronizing signals introduce 1/6 T to form 6 equally-spaced clocks in one local reference 

latency to the overall data transfer speed, which is problem- clock cycle in accordance with an embodiment of the 

a tic in high-speed data communication. In one method, invention; 

latency due to PLL acquisition time is minimized by trans- FIG. 4 illustrates the case where a phase transition has 

mitting synchronizing signals at predetermined intervals to 40 been detected between sampling points el and e2, which 

the data recovery circuit during sleep mode thereby allowing will be communicated to the phase synthesizer in a form of 

the PLL to maintain lock. However, the transmiUer circuit signal Pj in accordance with an embodiment of the inven- 

and the data recovery circuit are not fully in the sleep mode tion; 

resulting in a constant power dissipation. 5 ^ ^ tjo^k diagram of a phase synthesizer in 

The PLL also has poor jitter tolerance in that being an 45 accordance with an embodiment of the invention; 

analog device, which uses a voltage-controlled oscillator, pic. 6 illustrates a sigma-delta modulator in accordance 

the PLL mherenUy generates noise. Furthermore, the PLL of ^n embodiment of the invention; 

the data recovery circuit syncs with the transmitting clock to pj^, 7 illustrates a Shift Register Accumulator (SRA) in 

recover data. However, because the recovered data ^ not in .^.^ance with an embodiment of the invention; 

sync with the recovery circuit clock, the data needs to be 50 r.,^ o .„ .... ^ ^ . - 

resynced using an additional re-timing circuit. As an addi- ^ F G. 8 mms two logic diagrams of a De 

tional problem, being an analog circuit the PLL -based data Subtracter (DMS), the first where N>3 and the second where 

recovery circuit is rather complicated and cannot be easily ° ' 

implemented in a CMOS fabrication process. Therefore, F^^. 9 illustrates a Decision Matrix Subtracter (DMS), 

generaUy, two fabrication processes are performed. First, a 55 ^^^^^ ^he no operation (NOP) commands are represented by 

CMOS fabrication process is performed to implement the "^'"s and the shift-right (SR) and shift-left (SL) commands 

digital circuits in the substrate. This is followed by a bipolar ^re represented by AND and OR gates; 

fabrication process to implement the analog circuits in the FIG. 10 illustrates the example where the AND gate 

substrate. However, using two processes complicates the detects a shift-left (SL) operation and a borrow signal is in 

manufacturing process and is generally not cost efficient. In 60 transmitted to the multi-cycle phase (MCP) register in 

addition, PLL-based circuits are highly susceptible to the accordance with an embodiment of the invention; 

stray resistance and capacitance of the substrate. In many FIG. 11 illustrates the example where the AND gate 

instances, the effects of the stray resistance and capacitance detects a shift-right (SR) operation and a carry signal is in 

of the substrate are so severe that the characteristics of the turn transmitted to the multi-cycle phase (MCP) register in 

original PLL design are altered. Thus, the PLL does not 65 accordance with an embodiment of the invention; 

perform according to its specification. Due to this problem, FIG. 12 is a schematic diagram of a delay line in accor- 

the PLL design may not be readily scalable and may depend dance with an embodiment of the invention; 
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FIG. 13 is a schematic diagram that illustrates two general may be stored in an N-bit in -cycle phase (ICP) error register 

approaches to synchronizing a transmitting clock in a trans- and an M-bit multi-cycle phase (MCP) error register. This 

milting circuit and a receiving clock in a receiving circuit as allows DDL 25 to have a tracking range of M clock cycles 

to allow transfer between the two circuits in accordance with with a 1/N clock cycle resolution. 

an embodiment of the invention; 5 According to one embodiment, DDL 25 comprises an 

RG. 14 is a schematic diagram of a system wherein a ^^2,± ^^S^*^ first-in-first^ut 

peripheral controller comprises a data recovery circuit and is (^^O) agister with fixed output port. The fine or in-cycle 

coupled to a processor that is adapted to access data from the P*^^^ error of the incommg data is compensaled by se ecting 

- u 1 . 11 ■ J i_ J- * f the corresponding N re-timed data from the output of the OS 

peripheral controller m accordance with an embodiment of i . f . i , 

f. . in block 10 usmg the MUX accordmg to the value of the ICP 

the invention. ■^^ ■ . i*- i j . i. i_ 

register. 1 ne coarse or multi-cycle data phase error may be 

DETAILED DESCRIPTION compensated by adjusting the length of the FIFO according 

to the value of the MCP register. Further details of the above 

When used in a data recovery scheme, in one scheme will now be described in more detail in the passages 

embodiment, the data recovery circuit uses a controlled below. 

digital delay line to adaptively compensate for incoming piQ 2 is a schematic diagram of over-sampling (OS) 
data phase error using an over-sampUng Sigma delta modu- ^lock 30 and phase detector 45 in accordance with an 
lation algonthm. Such configuration ensures the phase of the embodiment of the invention. The incoming data is over- 
data at the output of the delay Ime to be in sync with the data ^^^pi^d using N (N>2) phase clocks. FIG. 3 illustrates an 
recovery circuit clock (i.e., local reference clock as opposed over-sampling clock timing diagram in which the local 
to the transmitting clock). Thus, a DLL-based data recovery reference clock (e.g., the data recovery circuit clock) is 
circuit does not require additional data sync operation as in phase shifted 6 times by 1/6 T (where T is one clock cycle 
the PLL-based scheme. In this manner, a DLL-based data jo^al reference clock) to form 6 clocks (i.e., N=6) in 
recovery circuit achieves very short acquisition lime when one local reference clock cycle. One of the edges (e.g., rising 
compared to the acquisition time of the PLL. Further, a edge)of the clocks are used as sampUng points el ... e2 and 
DLL-based data recovery circuit has relatively short latency, sample the incoming data. During each of its clocks 
which may be as short as 4-10 clock cycles, including ^^^^^^ incoming data are taken to provide the 
elastic buffermg. Further, the data recovery scheme is highly over-sampling of incoming data. Because N (in this 
jitter tolerant, -^lus^ from above, it can be seen that these .^^mple, N.6) sampling points correspond to a single local 
features make the DLL data recovery scheme very suitable reference clock cycle, the sampled data is re-timed to the 
for various low power and high-speed applications. Iq^^I reference clock. Referring back to FIG, 2, the phase 
The scheme may be implemented using a digital-based shift into N clocks may be performed by delay line 35 or 
analog (DBA) design approach, which utilizes analog func- other equivalents. The N clocks are used as triggers to the 
tions using digital circuits based on certain digital signal respective N number of flip-flops 40. Each flip-flop is 
processing (DSP) algorithms. The DBA approach makes the 35 triggered by the rising edge of one of the N clocks. The 
circuit implementation highly scalable and allows the circuit incoming data line is coupled to the input of the flip-flops 40 
to be directly integrated onto a digital-based chip without which is sampled by the flip flops to be outputted as sampled 
degrading its rehability, manufacHirability and testability. data dl, d2, . . , dN. 

Various embodiments will be described to aid in the under- Sampled data dl, d2, . . . dN is used by phase detector 45 

statiding of the irivention and should not be construed as 4^ to determine the phase transition of the incoming data, 

limitations to the invention. According to one embodiment, phase detector 45 comprises 

FIG. 1 is a schematic diagram of a Delay Locked Loop a plurality of Exclusive-OR (XOR) gates. An XOR function 

(DLL) based data recovery circuit 5 in accordance with an is performed on one sampled data and the immediate 

embodiment of the invention. As shown in the figure, the sampled data. If there is a data transition between the two 

data recovery circuit comprises over-sampling (OS) block 45 sampled data, the corresponding XOR gate outputs a signal 

10, phase detector 15, phase synthesizer 20 and delay line to acknowledge the data transition. For example, as shown 

25. While delay line 25 may be formed using analog in FIG. 4, between the sampling points el and e2 a phase 

components, in a digital circuit-based fabrication process, it transition has been detected, which will be communicated to 

is desirable to use a digital-based delay line. The figure the phase synthesizer in a form of signal p2. 

illustrates a digital delay line (DDL) 25. To aid in the 50 FIG. 5 is a block diagram of phase synthesizer 50 in 

understanding of the invention, an overview of the embodi- accordance with:an embodiment of the invention. According 

ment is given below. to the embodiment, the phase synthesizer 50 synthesizes the 

OS block 10 over-samples the incoming digital data incoming data phase error from the detected phase transition 

stream (shown as Rx signal input) using N (N^2) phase (from the phase detector) and the previously stored data 

clocks. The N sampled data are re-aligned to the local 55 phase error (in the shift registers to be described below) 

reference clock to provide N re-timed parallel data, which is based on a digital sigma-delta modulator algorithm. FIG, 6 

the same incoming data with (T/N) delays in the clock cycle illustrates a sigma-delta modulator 55 in accordance with an 

(T). The phase error of the detected data transition (i.e., the embodiment of the invention. Sigma-delta modulator 55 

phase difference between the data transition and the local according to one embodiment comprises Shift Register 

reference clock) is determined from the over-sampled data. 60 Accumulator SRA 60 and Decision MaUix Subtractor DMS 

In this figure, phase detector 15 is shown to detect the data 65. FIG. 7 illustrates Shift Register Accumulator SRA 60 in 

transition. The re-timed data and the phase error are then fed accordance with an embodiment of the invention. SRA 60 

into the data and control path of data recovery circuit 5. comprises two shift registers 70, 75, that are used as counters 

Phase synthesizer 20 synthesizes the incoming data phase to store the values of the synthesized in-cycle phase (ICP) 

error from the transition phase error and the previous data 65 and multi-cycle phase (MCP). The ICP shift register 70 

phase error based on a sigma delta modulation algorithm. comprises N bits which track phase error within a cycle of 

The synthesized in-cycle and multi-cycle data phase errors the local clock, for example. According to the embodiment. 
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the N bits correspond to the number of sampling points used sented by "0"'s and the shift-right (SR) and shift-left (SL) 

in the OS block. The number of sampling points and N bits commands are represented by AND and OR gates. Thus no 

of ICP shift register 70 may be selected according to a operation will be performed in the ICP register 70, since 

desired result, however, the greater the value N, a more both signals are said to be in phase or synchronized as no 

refined mning of the in-cycle phase error may be performed. 5 transition was detected. The shift-left (SL) operations can be 

MCP shift register 75 comprises M bits, which track the represented and performed using AND gates connected to 

phase error according to a number of clock cycles. Stated another, where an OR gate is finally connected to its 

differently, MCP shift register 75 determines the phase error ^ ^^.f, ^ ^ ^^^^ ^1,;^.,^^, 

m M clack cycles. MCPshifl register 75 and ICP shift clock ^^L) operation is detected, the enabling bit of the previous 

register 70 allows the Delay Locked Ixx)p (DLL) to have a ^^^^^ ^^^^ ,(,p 70 wiU be shifted to the left until a 

phase error tracking range of M clock qrcl^ with 1/N clock ^^^^^^ generated, so that it may be in 

''^f J^o 11 r- ' "Per^^ons of ICP sMt reg^ter 70 „^ synchronized with the signal obtained from the 

and MCP shift register 75 are performed by Shift Right (SR) ^ ^ detector 

operation for phase lag and Shift Left (SL) operation for ^..^-^ ■ ^ 

phase lead. The SR and the SL operations are determined by Shift-right (SR) operations can be represented and 

a DMS. which will be described with reference to FIG. 8. " P«Eonned, agam using AND gates connected to one another, 

-me operation of ICP register 70 and MCP register 75 are *° °^ 8"'^ finally connected to i^ output. In such 

now described. According to one embodiment when the ' f^'^'-flt" ^'°IV (SR) operaUon 

data recovery circuit is powered up, perhaps through a wake ''.^f *='^^' "if, ^" °f 'f ! ^''u^^ 

up mode, ICP register 70 and MCP register 75 lore pre- 't^'^'" ^3 ^ .^'"f'^. ° ^ ngh«. that it may be m 

defined values that may be approximately in phase with the P^'^ °' synchronized with the signal obtamed from the 

Onisc detector 

incoming data. Dependent on the SR and SL commands ^ 

received from the DMS, ICP register 70 shifts an enabling ^ ^° example, if the detected transition phase is p2 and 

bit to the right or left respectively. The enabUng bit enables previously stored bit in ICP shift register 70 is si, the 

one of the signals, SI, S2 , . . SN. ITie enabled signal SI, S2 25 transmits an SR signal to ICP shift register 70. ICP 

. . . SN of ICP register 70 determines the delay duration at ^hifi register 70 shifts right one bit or s2. If the transition 

the delay line as will be apparent with respect to FIG. 9. If P^^^^e remains at p2, then no signal is generated by DMS 

ICP register 70 reaches one end (i.e., the least significant bit because the delected transition phase p2 and the synthesized 

(LSB), or the most significant bit (MSB), of the register in-cycle phase s2 are in sync. Similarly, if the detected 

without syncing with the incoming data, an overflow (e.g., 30 transition phase is p2 and the synthesized in-cycle phase is 

carry) or an underflow (e .g., borrow) is transmitted to MCCP ^N, DMS transmits a SR to ICP shift register 70. Because the 

register 75. bit sN is the MSB in the ICP shift register 70, a carry signal 

If MCP register 75 receives a carry signal, MCP register ^ transmitted to MCP shift register 74, which increments 

75 increments by one bit to indicate that the incoming data *° "^^'^^^"^ ^ ^y^^^* ^hift register 70 

has one cycle lead over the local clock. Conversely, if MCP 35 fe-counts as si. If the detected transition phase is at p2, 

register 75 receives a borrow signal from ICP register 70, ^^^"^^ "^^y^^^ P^^^ ^ ^1, DMS transmits a SR 

MCP register 75 decrements by one bit to indicate that the ^^6°^^ shift register 70. The ICP shift register 

incoming data has one cycle lag over the local clock. The increments one bit to reflect s2. If the detected transition 

enabUng bit of MCP register 75 enables one of the signals P*^^^^ ^ P^' °o ^'^^"^ operation is transmitted by DMS 

tl, t2 . - . tM that determines a delay duration of the delay 40 ^^^^^^^ ^^^^^^^'^ transition phase and the synthesized 

line as will be apparent in FIG. 12. Once MCCP register 75 m-cycle phase are m sync. 

has incremented or decremented by one bit, ICP register 70 1° the preceding DMS matrix, when x=l and y«N, a carry 

shifts the enabling bit again, according to the SR and SL signal is found. Also in the preceding DMS matrix, when 

signals. The process resumes until SRA 70 syncs with the x«N and y«l, a borrow signal is found. Both an overflow 

incoming data. 45 (e g-, carry) or an underflow (e.g., borrow) can be repre- 

FIG. 8 is a logical diagram of a decision matrix subtractor by AND gates. For example, as shown in FIG. 10, in 

DMS 80 for N>3 and DMS 85 for N=3, in accordance with ^ borrow signal, if the previous state was 100 and 

one embodiment of the invention. DMS 80 contains possible °^xt state was 001. the AND gate would detect a 

combinations obtained by the intersection of a detected shift-left operation and a borrow signal would in tuna be 

transition phase (i.e., pi, p2, p3 . . . pN) and a synthesized 50 transmitted to MCP register 75. Also, for example, as shown 

in-cycle phase (i.e., si, s2, s3 . . . sN) and affects the ^ ^ case of a carry signal if the previous state 

previously stored value in ICP shift register 70 (see FIG. 7). ^be next state was 100, the AND gate would 

As iUustrated, there are a plurality of operations designated ^^^^^ ^ shift-right operation and a carry signal would in turn 

as no operations (NOP), shift right (SR), and shift left (SL) be transmitted to MCP register 75. 

stored within the matrix. The ICP shift register 70 operates 55 It should also be noted that in the preceding DMS matrix, 

dependent on the selected operation. The number of rows in when x=l and y=N, up to 2 operations can be performed, 

the N-dimensional matrix are determined by the signals that is a shift-right (SR) for FDDL 95 and a shiftright (SR) 

obtained from the phase detector (P^Pj • • • Pjv)- The number for CDDL 100-if there is a carry signal. Thus, 2 bits of data 

of columns in the n-dimensional matrix are determined by may be transmitted to MCP register 75, As well, when x=N 

the enabled signals obtained from the enabling bit in ICP 60 y°l» tip to 2 operations can be performed, that is a 

register 70, in the previous state. Because a 3-dimensional shift-left (SL) for FDDL and a shift-left (SL) for CDDL if 

matrix involves a special case, it can be represented by DMS there is a borrow signal. Thus, two bits of data may be 

85. For all other cases, where N>3, DMS 80 can be used, transmitted to MCP register 75. 

Denoting the subscript for P signals as X(P;r) and for S FIG. 12 is a schematic diagram of a delay line in accor- 

signals as Y(Sy), it is possible to fill in the matrix with either 65 dance with an embodiment of the invention. In this figure, a 

3 different operations (SR, SL or NOP), FIG. 9 illustrates a Digital Delay Line PDL) is contemplated although an 

DMS where the no operation (NOP) commands are repre- Analog Delay Line may be used according to a desired 
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result. DDL 90 comprises Fine Digital Delay Line FDDL 95 
and Coarse Digital Delay Line CDDL 100. The delay 
duration of FDDL 95 is controlled by ICP shift register 70 
(see FIG. 7) by signals si, s2 . . . sN. Conversely, the delay 
duration of CDDL 100 is controlled by the MCP shift 5 
register 75 by signals tl, t2 . . . IN. FDDL 95 compensates 
the phase difference between the transition and the local 
clock that occurs within the local clock cycle. CDDL 100 
compensates the phase difference between the transition and 
the local clock where the difference is a multiple of local lo 
clock cycles. 

Referring to the figure, in one embodiment, FDDL 95 is 
a N-to-1 digital MUX thai selects one of the data stream dl, 
d2 . . . dN according to one of the signals, si, s2 . . . sN 
enabled by ICP register 70. In that manner, ICP register 70 
selects the optimal data stream that has been compensated 
for the phase error. CDDL 100 according to one 
embodiment, is a first-in-first -out register (FIFO), which 
may be formed by a plurality of flip-flops (FFs). Each 
enabled flip-flop delays the data stream by one local clock 20 
cycle. The number of local clock cycle delays is determined 
by tl, t2 . . . tM signals enabled by MCP register 75. Thus, 
if ICP shift register 70 has N bits, and the MCP shift register 
75 has M bits, DDL 90 wiU, generally, be designed to have 
a phase error tracking range of M clock cycles with 1/N 25 
clock cycle resolution. DDL 90 generally compensates the 
phase difference by delaying the incoming data until the 
incoming data is in sync with the local reference clock. 

FIG. 13 is a schematic diagram that iUustrates two general 
approaches to synchronizing a transmitting clock in a trans- '^^ 
milting circuit, and a receiving clock in recovery circuit 105, 
as to allow data transfer between the two circuits. In one 
approach, the local clock may be variably delayed by 
variable delay device 110 until the local clock is in sync with 
the incoming data (R^ signal input). This approach is gen- 
erally used by PLL-based schemes where the variable delay 
device 110 is a voltage-controlled oscillator. In the second 
approach, the incoming data is variably delayed by variable 
delay device 115 until the incoming data is in sync with the 
local clock. This approach is generally used by DLL-based 
scheme where variable delay device 115 is on delay line 
such as DDL 90 described in FIG. 12. 

FIG. 14 is a schematic diagram that iUustrates a system 
120 wherein a peripheral controller 140 comprises a data 
recovery circuit 145. Peripheral controller 140 is coupled to 
processor 130 via a serial or parallel bus 135. Processor 130 
is adapted to access data from peripheral controller 140 via 
bus 135. 

Memory 125, and display controller 150, may also be 
coupled to peripheral controller 140 via bus 135. Monitor 
155 may also be coupled to display conU-oller 150. And, 
other peripheral devices 160, such as a mouse, CD-ROM 
and video, may also be coupled to peripheral controller 140. 

FIG. 14 illustrates but one application of the invention, 55 
that is the personal computer but may be replaced by other 
applications such as a workstation, server, Internet driver or 
other fabric channels used as a link. 

The invention has been described with respect to various 
embodiments. The invention aUows for data recovery with 60 
various over-sampling rates and multi-level digital signal 
inputs. The multi-level digital data sampling capability may 
be used for multi-level digital data communication and 
equalization. According to one embodiment, this may be 
implemented using a programmable phase synthesizer 65 
implemented as a programmable SRA and a programmable 
DMS. The invention aUows for data communication circuits 
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to be designed using DLL instead of PLL. Circuit imple- 
mentation of the described data recovery scheme above has 
yielded a size of less than 25% and lower power dissipation 
when compared with a conventional PLL-based scheme. 
When a digital DLL is used, a single digital fabrication 
process may be used because the circuit to be fabricated is 
mainly digital, better testability, scalability, and manufac- 
turability may be obtained. 

In the preceding detailed description, the invention is 
described with reference to specific embodiments thereof. It 
will, however, be evident that various modifications and 
changes may be made thereto without departing fi-om the 
broader spirit and scope of the invention as set forth in the 
claims. The specification and drawings are, accordingly, to 
be regarded in an illustrative rather than a restrictive sense. 

What is claimed is: 

1. A system comprising: 
a processor; and 

a peripheral controller coupled to the processor, the 
processor adapted to access data fi-om the peripheral 
controller, the peripheral controller comprising a data 
recovery circuit, the data recovery circuit having a 
sampling circuit to generate sampling clocks from a 
local clock of the system, the sampling clocks to 
sample incoming data, a phase detector to detect a data 
transition in the sampled data, a phase synthesizer to 
generate a predicted phase based on the detected data 
transition, and a delay line to delay the sampled data in 
accordance with the predicted phase. 

2. The system of claim 1, wherein the phase detector 
comprises a plurality of Exclusive-OR (XOR) gates, lo 
perform a fiinction on a plurality of sampled data and lo 
provide a signal that acknowledges a data transition between 
two sampled data. 

3. The system of claim 1, wherein the sampling circuit is 
an oversampling circuit lo oversample incoming data using 
N (N>«2) clocks and the delay between two clocks is T/N 
where T is a local clock cycle and N is a real number. 

4. A data recovery circuit comprising: 

an oversampling block to generate a plurality of sampling 
signals from a local reference clock, to sample an 
incoming data signal that contains a plurality of data 
symbols; 

a phase detector having an input coupled to an output of 
the oversampling block, the detector to detect a plural- 
ity of transitions of the serial data signal based on a 
plurality of samples of the incoming serial data signal; 

a phase synthesizer having an input coupled to an output 
of the phase detector, the synthesizer to predict a phase 
of each of the plurality of data symbols in the incoming 
data signal; and 

a variable delay line having a control input coupled to an 
output of the phase synthesizer, a further input coupled 
lo an output of the oversampling block, and an output 
to provide the plurality of data symbols through a 
variable delay, 

wherein the synthesizer is to control the delay line in 
accordance with the predicted phase. 

5. The data recovery circuit of claim 4 wherein the 
synthesizer is to store in-cycle phase and mtilti-cycle phase 
information, the in-cycle phase information refers lo phase 
errors, between the serial data signal and the predicted 
phase, that are shorter than a single period of the local clock, 
and the multi-cycle phase information refers to phase errors 
that are longer than a single period of the local clock. 

6. The data recovery circuit of claim 5 wherein the 
variable delay line has a coarse delay portion and a fine 
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delay portion, the coarse delay portion to be adjusted accord- 
ing to the multi-cycle phase information and the fine delay 
portion to be adjusted according to the in-cycle phase 
information. 

7. The data recovery circuit of claim 4 wherein the 5 
variable delay line has an input that is coupled to the 
oversampling block to receive the plurality of samples of the 
incoming serial data signal. 

8. The data recovery circuit of claim 4 wherein the phase 
detector is to detect a transition between adjacent samples of lO 
the incoming serial data signal. 

9. The data recovery circuit of claim 8 wherein the 
oversampling block is to sample at a rate that is N times the 
frequency of the local clock, where N is greater than one. 

10. The data recovery circuit of claim 4 wherein the phase i5 
synthesizer comprises a decision matrix subtracter (DMS) 
having inputs to receive a detected phase and a predicted 
phase, and a shift register accumulator (SRA). 

11. The data recovery circuit of claim 10 wherein the 
DMS is lo generate a predetermined command, being one of 20 
shift right, shift left, and no operation, to the SRA, the SRA 
having a first shift register to contain a first bit which 
controls the amount of delay in the delay line. 

12. The data recovery circuit of claim 11 wherein the SRA 
has a second shift register, and wherein an underflow or an 25 
overflow in the first register causes a second bit of the 
second register to be shifted. 

13. The data recovery circmt of claim 12 wherein the 
delay line comprises: 

a first delay line and a second delay line, wherein the first 
delay line is to delay the sampled data up to one local 
clock cycle and the second delay line is to delay the 
sampled data by more than one local clock cycle, the 
first shift register is to control the delay of the first delay 
line and the second shift register to control the delay of 
the second delay line. 

14. The data recovery circuit of claim 13 wherein the first 
delay line is an N-to-1 MUX that selects one of a plurality 
of sampled data dl, d2 . . . dN according to one of signals, 
SI, S2 . . . SN enabled by the first shift register. 40 

15. The data recovery circuit of claim 14 wherein the 
second delay line is a plurality of flip-flops (FFs), wherein 
each flip-flop when enabled delays the plurality of sampled 
data by one local clock cycle and the number of local clock 
cycle delays is determined by signals Tl, T2 . . . TM enabled 45 
by the second register. 

16. A method for recovering data, comprising: 
detecting transitions in an incoming serial data signal that 

contains a sequence of data symbols, by oversampling 
the incoming serial data signal based on a local refer- 
ence clock signal; 
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determining a phase error between a detected transition of 
the incoming serial data signal and a predicted phase of 
the incoming serial data signal wherein the phase error 
is determined using a decision matrix of mxm 
elements, wherein m is an integer greater than one, 
which contain combinations of detected transition 
phase and predicted phase; and 

changing a timing of a sequence of recovered data sym- 
bols according to the phase error. 

17. The method of claim 16 wherein the oversampling is 
at a rate that is an integer multiple of a frequency of the local 
reference clock signal. 

18. The method of claim 17 ftirther comprising generating 
the predicted phase of the incoming serial data signal using 
a sigma-delta modulation algorithm. 

19. A method for data recovery comprising: 
sampling a data signal based on a local reference clock, to 

obtain a plurality of samples of each symbol period in 
the data signal; 

detecting a data transition in the data signal using the 
plurality of samples; 

synthesizing a phase based on the detected data transition 
using a sigma-delta modulation algorithm implemented 
by (a) a decision matrix subtractor (DMS) that operates 
based on the data transition as an input, and (b) a shift 
register accumulator (SRA) controlled by the DMS and 
whose output is used to derive the phase; and 

delaying the plurality of samples in accordance with the 
synthesized phase. 

20. The method of claim 19 wherein the data transition is 
detected using a plurality of sampling signals derived from 
the local reference clock. 

21. A method for data recovery comprising: 
sampling a data signal based on a local reference clock, to 

obtain a plurality of samples of each symbol period in 
the data signal; 

detecting a data transition in the data signal using the 
plurality of samples; 

synthesizing a phase based on the detected data transition 
wherein the phase is synthesized by configuring a shift 
register with an output bit pattern that will represent the 
synthesized phase, and applying a shift left, a shift 
right, and don't shift commands to the shift register 
based on a plurality of combinations of a detected data 
transition and a synthesized phase; and 

delaying the plurality of samples in accordance with the 
synthesized phase by applying the output bit pattern of 
the shift register to a delay line. 

* ♦ * * * 
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